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(57) Abstract: The invention is directed to isolated polypeptides 
bearing sequence homology to the Sp36 protein found in 
pneumococcal organisms, such as Streptococcus pneumoniae. 
Polynucleotides encoding such polypeptides are also disclosed. 
The invention also relates to antibodies specific for the disclosed 
polypeptides and to uses of such antibodies in the treatment 
of diseases caused by staphylococci as well as group A and B 
streptococci. In addition, the invention relates to the use of the 
disclosed polypeptides in compositions and as vaccines and for 
prophylactic uses such as in vaccination of animals, especially 
humans, against a wide variety of streptococcal, staphylococcal 
and other diseases. 
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HOMOLOGS OF A PNEUMOCOCCAL PROTEIN AND FRAGMENTS FOR VACCINES 

5 

This application claims the priority of U.S. Provisional Application 
60/150,750, filed August 25, 1999, the disclosure of which is hereby 
incorporated by reference in its entirety. 

10 

FIELD OF THE INVENTION 

This invention relates generally to the field of bacterial antigens and 
their use, for example, as immunogenic agents in humans and animals to 
15 stimulate an immune response. More specifically, it relates to the 
vaccination of mammalian species, especially humans, with one or more 
polypeptides derived from gram positive bacteria and which show sequence 
homology with an immunogenic polypeptide obtained from Streptococcus 
pneumoniae. 

20 

BACKGROUND OF THE INVENTION 

Polypeptides derived from gram positive bacteria are useful for 
25 stimulating production of antibodies that protect the vaccine recipient 
against infection by a wide range of serotypes of pathogenic gram positive 
bacteria, including S. pneumoniae. Further, the invention relates to 
antibodies against such polypeptides useful in diagnosis and passive 
immune therapy with respect to diagnosing and treating such pneumococcal 
30 infections. 
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The genus Streptococcus contains a variety of species responsible 
for causing disease in mammals, including humans, while also 
encompassing species that constitute normal flora in humans and other 
mammals. Among the bacterial species implicated in the etiology of 

5 diseases in humans are S. pyogenes (part of the group A streptococcal 
bacteria, herein designated "GAS" for "group A streptococci"), S. 
pneumoniae (referred to as "pneumococcus") and S. agalactiae (the group B 
streptococci or "GBS"). The group A streptococci cause serious diseases 
such as necrotizing fasciitis, scarlet fever and sepsis, as well as less virulent 

10 diseases such as impetigo and pharyngitis. The pneumococci are the most 
common cause of community-acquired pneumonia and are also responsible 
for more than half of all cases of otitis media in children. The pneumococci 
are also the second most common pathogen associated with bacterial 
meningitis. The group B streptococci are the most prevalent pathogen 

15 associated with illness and death among newborns in the United States. 

Currently, there are no vaccines available for the prevention of 
diseases caused by the group A and group B streptococci and presently 
available pneumococcal vaccines are not effective in children under 2 years 
20 of age or in the elderly due to the poor immunogenicity of the capsular 
carbohydrates that compose the current vaccine. It would therefore be 
highly advantageous to produce a vaccine that would prevent infection by 
these classes of pathogen, especially in the age groups mentioned. 

25 In addition to the pathogens just described, some bacteria of the 

genus Staphylococcus are also of clinical importance. In fact, two of these 
are among the leading causes of nosocomial infections (infections acquired 
while in the hospital). Both Staphylococcus aureus and Staphylococcus 
epidermidis readily colonize the skin of healthy individuals and can cause 

30 acute disease in patients following immunosuppression or traumatic injury. 
Infections caused by these species include bacteremia, endocarditis, 
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osteomyelitis, wound infections and infections associated with indwelling 
catheters. 

Streptococcus pneumoniae is a gram positive bacterium that is a 

5 major causative agent in invasive infections in animals and humans, such as 
the aforementioned sepsis, meningitis, and otitis media, as well as lobar 
pneumonia (Tuomanen, et al. New England J. of Medicine 322:1280-1284 
(1995)). As part of the infection process, pneumococci readily bind to non- 
inflamed human epithelial cells of the upper and lower respiratory tract by 

10 binding to eukaryotic carbohydrates in a lectin-like manner (Cundell et al., 
Micro. Path. 17:361-374 (1994)). Conversion to invasive pneumococcal 
infections for bound bacteria may involve the local generation of 
inflammatory factors which may activate the epithelial cells to change the 
number and type of receptors on their surface (Cundell, et al., Nature, 

15 377:435-438 (1995)). Apparently, one such receptor, platelet activating 
factor (PAF) is engaged by the pneumococcal bacteria and within a very 
short period of time (minutes) from the appearance of PAF, pneumococci 
exhibit strongly enhanced adherence and invasion of tissue. Certain soluble 
receptor analogs have been shown to prevent the progression of 

20 pneumococcal infections (Idanpaan-Heikkila et al. f J. Inf. Dis., 176:704-712 
(1997)). A number of other proteins have been suggested as being 
involved in the pathogenicity of S. pneumoniae. 

Streptococcus pneumoniae itself has been shown to contain a gene 
25 which encodes a protein designated herein as Sp36. This protein has a 
predicted molecular mass of 91,538 Da and contains 5 histidine triad motifs 
(proposed to be involved in metal binding). The gene encoding this protein 
appears to be present the 23 serotypes comprising the current commercially 
available pneumococcal-capsular vaccine. Immunization of mice with this 
30 protein, in the presence of Freund's adjuvant, stimulates an immune 
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response which protects these mice from an intraperitoneal challenge with a 
dose of virulent pneumococci that would normally kill the mice. 

For the reasons already stated above, there not only remains a need 
5 for identifying polypeptides having epitopes in common from various strains 
of S. pneumoniae but also from a broader spectrum of gram positive 
bacteria in order to utilize such polypeptides as vaccines to provide 
protection against a wide variety of infectious organisms. 

10 BRIEF SUMMARY OF THE INVENTION 

In accordance with the present invention, there is provided vaccines 
that include polypeptides obtained from gram positive bacteria other than S. 
pneumoniae, as well as variants of said polypeptides and active fragments 
15 of such polypeptides. 

The present invention is also directed to novel genes, and the 
polypeptides encoded thereby, derived from gram positive bacteria other 
than S. pneumoniae, and which bear sequence homology to the Sp36 gene 
20 already described. Such gram positive bacteria include the group A and B 
streptococci, as described herein, as well as species of the genus 
Staphylococcus, especially S. aureus. 

In a particular embodiment, the present invention is directed to 
25 specific gene sequences, and proteins encoded thereby, derived from the 
group A and group B streptococci, and to the use of such expressed 
polypeptides and proteins as the basis for pharmaceutical compositions 
useful as vaccines and as a means for enabling isolation of antibodies with 
therapeutic and/or prophylactic activity (such as would be useful in 
30 preparing products like CytoGam). 
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In a further embodiment, the present invention also relates to the 
preparation and use of fragments of the novel polypeptides disclosed herein, 
such fragments being immunogenic in nature and being useful in the 
preparation of vaccines against diseases caused by the pathogens from 
5 which such polypeptides, and fragments thereof, are derived. 



10 

BRIEF DESCRIPTION OF DRAWINGS 

Figures 1 shows the results of a Southern blot of genomic DNA 
15 from S. aureus, S. pyogenes, and pneumococcus. The DNA was digested 
with restriction nucleases BamYW or PvuW, and after electrophoresis and 
transfer to a nylon membrane, was probed with a labeled DNA fragment 
encompassing the pneumococcal gene encoding Sp36. The hybridization 
and washes were carried out under low stringency conditions. The results 
20 show hybridization by the labeled probe to a S. aureus fragment in both 
the BamH\ and Pvull lanes and to two fragments in the PvuW digests of 
two strains of S. pyogenes. 

Figures 2 shows an alignment between the Sp36 amino acid 
25 sequence from S. pneumoniae strain N4 and the homologous sequences 
from S. pyogenes and S. agalactiae. Amino acids identical to those of the 
polypeptide from S. pneumoniae are boxed. 

Figure 3 shows the results of a Southern blot of genomic DNA from 
30 S. pyogenes, S. agalactiae, and S. pneumoniae probed with DNA 
encoding the full length Sp36 homolog from S. pyogenes. The 
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hybridization was carried out under low stringency conditions. These 
results demonstrate that the S. pyogenes Sp36 homolog, used as a 
probe, is capable of detecting a homologous gene in S. agalactiae and 
pneumococcus. 

5 

Figure 4 shows the results of a western blot using rabbit polyclonal 
antiserum generated against recombinant Sp36 protein cloned from S. 
pneumoniae strain Norway 4. The results demonstrate that this antiserum 
not only reacts with the protein against which it was raised (here, Sp36), 
10 as well as to a protein of similar size in a lysate of a serotype 6B strain of 
pneumococcus, but also reacts with a recombinant protein encoded by 
the Sp36 homolog gene of group B streptococci. 

Figure 5 shows the amino acid sequence for the GAS36 homologs 
15 with the histidine triad regions underlined (Fig. 5(a) and (b)) and the 
sequence for a GBS36 homolog (Fig. 5(c)) with its histidine triad regions 
underlined. 



20 

DETAILED DESCRIPTION OF THE INVENTION 

The present invention is directed to novel polynucleotides and 
polypeptides derived from species of gram positive bacteria, especially 
25 group A and B streptococci, and including the genus Staphylococcus, most 
especially S. pyogenes (GAS), S. agalactiae (GBS), and S. aureus, 
respectively. 

Further, the present invention is directed to polynucleotides derived 
30 from gram positive bacteria and which are at least partially homologous to 
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the polynucleotides making up the gene coding for the previously disclosed 
Sp36 gene of S. pneumoniae (U.S. Application Serial No. 60/1 13,048). 

The present invention is also directed to polynucleotides, and 
5 immunologically active fragments, segments, or portions, thereof, which 
polypeptides are encoded by the polynucleotides disclosed herein. 

The present invention also relates to such polynucleotides and 
polypeptides in enriched, preferably isolated, or even purified, form. 

10 

In accordance with the present invention, the term "DNA 
segment" refers to a DNA polymer, in the form of a separate fragment or 
as a component of a larger DNA construct, which has been derived from 
DNA isolated at least once in substantially pure form, i.e., free of 

15 contaminating endogenous materials and in a quantity or concentration 
enabling identification, manipulation, and recovery of the segment and its 
component nucleotide sequences by standard biochemical methods, for 
example, using a cloning vector. Such segments are provided in the form 
of an open reading frame uninterrupted by internal nontranslated 

20 sequences, or introns, which are typically present in eukaryotic genes. 
Sequences of non-translated DNA may be present downstream from the 
open reading frame, where they do not interfere with manipulation or 
expression of the coding regions. 

25 The nucleic acids and polypeptide expression products disclosed 

according to the present invention, as well as expression vectors 
containing such nucleic acids and/or such polypeptides, may be in 
"enriched form." As used herein, the term "enriched" means that the 
concentration of the material is at least about 2, 5, 10, 100, or 1000 

30 times its natural concentration (for example), advantageously 0.01%, by 
weight, preferably at least about 0.1% by weight. Enriched preparations 
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of about 0.5%, 1%, 5%, 10%, and 20% by weight are also 
contemplated. The sequences, constructs, vectors, clones, and other 
materials comprising the present invention can advantageously be in 
enriched or isolated form. 

5 

"Isolated" in the context of the present invention with respect to 
polypeptides (or polynucleotides) means that the material is removed from 
its original environment (e.g., the natural environment if it is naturally 
occurring). For example, a naturally-occurring polynucleotide or polypeptide 

10 present in a living organism is not isolated, but the same polynucleotide or 
polypeptide, separated from some or all of the co-existing materials in the 
natural system, is isolated. Such polynucleotides could be part of a vector 
and/or such polynucleotides or polypeptides could be part of a composition, 
and still be isolated in that such vector or composition is not part of its 

15 natural environment. The polypeptides and polynucleotides of the present 
invention are preferably provided in an isolated form, and most preferably 
are purified to homogeneity. 

The polynucleotides, and recombinant or immunogenic polypeptides, 
20 disclosed in accordance with the present invention may also be in 
"purified" form. The term "purified" does not require absolute purity; 
rather, it is intended as a relative definition, and can include preparations 
that are highly purified or preparations that are only partially purified, as 
those terms are understood by those of skill in the relevant art. For 

25 example, individual clones isolated from a cDNA library have been 
conventionally purified to electrophoretic homogeneity. Purification of 
starting material or natural material to at least one order of magnitude, 
preferably two or three orders, and more preferably four or five orders of 
magnitude is expressly contemplated. Furthermore, claimed polypeptides 

30 having a purity of preferably 0.001%, or at least 0.01% or 0.1%; and 
even 1 % by weight or greater is expressly contemplated. 
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The term "coding region" refers to that portion of a gene which either 
naturally or normally codes for the expression product of that gene in its 
natural genomic environment, i.e., the region coding in vivo for the native 
5 expression product of the gene. The coding region can be from a normal, 
mutated or altered gene, or can even be from a DNA sequence, or gene, 
wholly synthesized in the laboratory using methods well known to those 
of skill in the art of DNA synthesis. 

10 In accordance with the present invention, the term "nucleotide 

sequence" refers to a heteropolymer of deoxyribonucleotides. Generally, 
DNA segments encoding the proteins provided by this invention are 
assembled from cDNA fragments and short oligonucleotide linkers, or 
from a series of oligonucleotides, to provide a synthetic gene which is 

15 capable of being expressed in a recombinant transcriptional unit 
comprising regulatory elements derived from a microbial or viral operon. 

The term "expression product" means that polypeptide or protein that 
is the natural translation product of the gene and any nucleic acid 
20 sequence coding equivalents resulting from genetic code degeneracy and 
thus coding for the same amino acid(s). 

The term "fragment," when referring to a coding sequence, means a 
portion of DNA comprising less than the complete coding region whose 
25 expression product retains essentially the same biological function or 
activity as the expression product of the complete coding region. 

The term "primer" means a short nucleic acid sequence that is 
paired with one strand of DNA and provides a free 3'OH end at which a 
30 DNA polymerase starts synthesis of a deoxyribonucleotide chain. 
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The term "promoter" means a region of DNA involved in binding of 
RNA polymerase to initiate transcription. 

The term "open reading frame (ORF)" means a series of triplets 
5 coding for amino acids without any termination codons and is a sequence 
(potentially) translatable into protein. 

As used herein, reference to a DNA sequence includes both single 
stranded and double stranded DNA. Thus, the specific sequence, unless 
10 the context indicates otherwise, refers to the single strand DNA of such 
sequence, the duplex of such sequence with its complement (double 
stranded DNA) and the complement of such sequence. 

In accordance with the present invention, the term "percent identity" 
15 or "percent identical," when referring to a sequence, means that a sequence 
is compared to a claimed or described sequence after alignment of the 
sequence to be compared (the "Compared Sequence") with the described or 
claimed sequence (the "Reference Sequence"). The Percent Identity is then 
determined according to the following formula: 

20 

Percent Identity = 100 [1-(C/R)1 

wherein C is the number of differences between the Reference Sequence 
25 and the Compared Sequence over the length of alignment between the 
Reference Sequence and the Compared Sequence wherein (i) each base or 
amino acid in the Reference Sequence that does not have a corresponding 
aligned base or amino acid in the Compared Sequence and (ii) each gap in 
the Reference Sequence and (iii) each aligned base or amino acid in the 
30 Reference Sequence that is different from an aligned base or amino acid in 
the Compared Sequence, constitutes a difference; and R is the number of 
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bases or amino acids in the Reference Sequence over the length of the 
alignment with the Compared Sequence with any gap created in the 
Reference Sequence also being counted as a base or amino acid. 

5 If an alignment exists between the Compared Sequence and the 

Reference Sequence for which the percent identity as calculated above is 
about equal to or greater than a specified minimum Percent Identity then the 
Compared Sequence has the specified minimum percent identity to the 
Reference Sequence even though alignments may exist in which the 

10 hereinabove calculated Percent Identity is less than the specified Percent 
Identity. 

Thus, the present invention is directed to novel, isolated polypeptides 
comprising an amino acid sequence at least 75% identical to a sequence in 
15 SEQ ID NO: 2, 4 or 6, preferably polypeptides at least 90% identical 
thereto, more preferably 95% identical to the sequence of SEQ ID NO: 2 or 
4, and most preferably having the sequence of either SEQ ID NO: 2 or 4. 

The isolated polypeptides of the present invention may be found in a 
20 wide variety of microorganisms, but will commonly be found in an organism 
selected from the group consisting of group A streptococci, group B 
streptococci, and Staphylococcus aureus, and wherein the group A 
streptococcal organism is Streptococcus pyogenes and the group B 
streptococcal organism is Streptococcus agalactiae. Also, polypeptides of 
25 the invention include, but are in no way limited to, isolated polypeptides 
having a sequence at least 25% identical to the amino acid sequence of the 
Sp36 protein of Streptococcus pneumoniae. 

The present invention further relates to immunogenically active 
30 fragments of the isolated polypeptides disclosed herein. 
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The terms "fragment," "derivative" and "analog" when referring to 
the polypeptides disclosed herein means a polypeptide which retains 
essentially the same biological function or activity as such polypeptide. 
Thus, an analog includes a proprotein, or preprotein, which can be 

5 activated by cleavage of the proprotein portion to produce an active mature 
polypeptide. Such fragments, derivatives and analogs must have sufficient 
similarity to the polypeptide of SEQ ID NO:2, 4 or 6 so that immunogenic 
activity of the native polypeptide is retained. 

The polypeptide of the present invention may be a recombinant 

10 polypeptide, a natural polypeptide or a synthetic polypeptide, preferably a 
recombinant polypeptide. 

The fragment, derivative or analog of the polypeptide of SEQ ID 
NO:2, 4, or 6 may be (i) one in which one or more of the amino acid 

15 residues are substituted with a conserved or non-conserved amino acid 
residue (preferably a conserved amino acid residue) and such substituted 
amino acid residue may or may not be one encoded by the genetic code, or 
(ii) one in which one or more of the amino acid residues includes a 
substituent group, or (iii) one in which the mature polypeptide is fused with 

20 another compound, such as a compound to increase the half-life of the 
polypeptide (for example, polyethylene glycol), or (iv) one in which the 
additional amino acids are fused to the mature polypeptide, such as a leader 
or secretory sequence or a sequence which is employed for purification of 
the mature polypeptide or a proprotein sequence. Such fragments, 

25 derivatives and analogs are deemed to be within the scope of those skilled 
in the art from the teachings herein. 

As known in the art "similarity" between two polypeptides is 
determined by comparing the amino acid sequence and its conserved amino 
30 acid substitutes of one polypeptide to the sequence of a second 
polypeptide. 
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Fragments or portions of the polypeptides of the present invention 
may be employed for producing the corresponding full-length polypeptide by 
peptide synthesis; therefore, the fragments may be employed as 
5 intermediates for producing the full-length polypeptides. Fragments or 
portions of the polynucleotides of the present invention may be used to 
synthesize full-length polynucleotides of the present invention. 

As used herein with reference to polypeptides, the terms "portion," 
10 "segment," and "fragment," refer also to a continuous sequence of 
residues, such as amino acid residues, which sequence forms a subset of a 
larger sequence. For example, if a polypeptide were subjected to treatment 
with any of the common endopeptidases, such as trypsin, chymotrypsin, or 
papain, the oligopeptides resulting from such treatment would represent 
15 portions, segments or fragments of the starting polypeptide. 

The present invention is also directed to isolated polynucleotides 
whose sequences contain coding regions encoding the polypeptides of the 
present invention, preferably the polypeptides of SEQ ID NO: 2, 4, and 6 
20 and most preferably will be the isolated polynucleotides comprising the 
sequences of SEQ ID NOS: 1, 3, and 5. 

The present invention is also directed to fragments or portions of 
such sequences which contain at least 1 5 bases, preferably at least 30 

25 bases, more preferably at least 50 bases and most preferably at least 80 
bases, and to those sequences which are at least 60%, preferably at least 
80%, and most preferably at least 95%, especially 98%, identical 
thereto, and to DNA (or RNA) sequences encoding the same polypeptide 
as the sequences of SEQ ID NOS: 2, 4, and 6 including fragments and 

30 portions thereof and, when derived from natural sources, includes alleles 
thereof. 
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Yet another aspect of the present invention is directed to an 
isolated DNA (or RNA) sequence or molecule comprising at least the 
coding region of a bacterial gene (or a DNA sequence encoding the same 
5 polypeptide as such coding region), in particular an expressed bacterial 
gene, which bacterial gene comprises a DNA sequence homologous with, 
or contributing to, the sequence depicted in SEQ ID NOS: 1 , 3, and 5 or 
one at least 60%, preferably at least 80%, and most preferably at least 
95%, especially 98%, identical thereto, including 100% identity, as well 
10 as fragments or portions of the coding region which encode a polypeptide 
having a similar function to the polypeptide encoded by said coding 
region. Thus, the isolated DNA (or RNA) sequence may include only the 
coding region of the expressed gene (or fragment or portion thereof as 
hereinabove indicated) or may further include all or a portion of the non- 
15 coding DNA (or RNA) of the expressed bacterial gene. 

In general, sequences homologous with and contributing to the 
sequences of SEQ ID NOS: 1, 3, and 5 (or one at least 60%, preferably at 
least 80%, and most preferably at least 95% identical or homologous 
20 thereto) are from the coding region of a bacterial gene. 

The polynucleotides according to the present invention may also 
occur in the form of mixtures of polynucleotides hybridizable to some extent 
with the gene sequences containing any of the nucleotide sequences of 
25 SEQ ID NOS: 1, 3, and 5, including any and all fragments thereof, and 
which polynucleotide mixtures may be composed of any number of such 
polynucleotides, or fragments thereof, including mixtures having at least 10, 
perhaps at least 30 such sequences, or fragments thereof. 

30 Fragments of the full length polynucleotide of the present invention 

may be used as hybridization probes for a DNA library to isolate the full 
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length DNA and to isolate other DNAs which have a high sequence 
similarity to the gene or similar biological activity. Probes of this type 
preferably have at least 1 5 bases, may have at least 30 bases and even 50 
or more bases. The probe may also be used to identify a DNA clone 

5 corresponding to a full length transcript and a genomic clone or clones that 
contain the complete gene including regulatory and promotor regions. An 
example of a screen comprises isolating the coding region of the gene by 
using the known DNA sequence to synthesize an oligonucleotide probe. 
Labeled oligonucleotides having a sequence complementary to that of the 

10 gene of the present invention are used to screen a library of DNA or mRNA 
to determine which members of the library the probe hybridizes to. 

The present invention is also directed to vectors comprising the 
polynucleotides disclosed herein, as well as to genetically engineered cells 
15 comprising such vectors and/or polynucleotides. Thus, the present 
invention also relates to vectors which include polynucleotides of the 
present invention, host cells which are genetically engineered with vectors 
of the invention and the production of polypeptides of the invention by 
recombinant techniques. 

20 

Host cells are genetically engineered (transduced or transformed or 
transfected) with the vectors of this invention which may be, for example, a 
cloning vector or an expression vector. The vector may be, for example, in 
the form of a plasmid, a viral particle, a phage, etc. The engineered host 
25 cells can be cultured in conventional nutrient media modified as appropriate 
for activating promoters, selecting transformants or amplifying the genes of 
the present invention. The culture conditions, such as temperature, pH and 
the like, are those previously used with the host cell selected for 
expression, and will be apparent to the ordinarily skilled artisan. 

30 
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The polynucleotides of the present invention may be employed for 
producing polypeptides by recombinant techniques. Thus, for example, the 
polynucleotide may be included in any one of a variety of expression 
vectors for expressing a polypeptide. Such vectors include chromosomal, 
5 nonchromosomal and synthetic DNA sequences, e.g., derivatives of SV40; 
bacterial plasmids; phage DNA; baculovirus; yeast plasmids; vectors derived 
from combinations of plasmids and phage DNA, viral DNA such as vaccinia, 
adenovirus, fowl pox virus, and pseudorabies. However, any other vector 
may be used as long as it is replicable and viable in the host. 

10 

The appropriate DNA sequence may be inserted into the vector by a 
variety of procedures. In general, the DNA sequence is inserted into an 
appropriate restriction endonuclease site(s) by procedures known in the art. 
Such procedures and others are deemed to be within the scope of those 
15 skilled in the art. 

The DNA sequence in the expression vector is operatively linked to 
an appropriate expression control sequence(s) (promoter) to direct mRNA 
synthesis. As representative examples of such promoters, there may be 

20 mentioned: LTR or SV40 promoter, the E. coli. lac or trp, the phage lambda 
P L promoter and other promoters known to control expression of genes in 
prokaryotic or eukaryotic cells or their viruses. The expression vector also 
contains a ribosome binding site for translation initiation and a transcription 
terminator. The vector may also include appropriate sequences for 

25 amplifying expression. 

In addition, the expression vectors preferably contain one or more 
selectable marker genes to provide a phenotypic trait for selection of 
transformed host cells such as dihydrofolate reductase or neomycin 
30 resistance for eukaryotic cell culture, or such as tetracycline or ampicillin 
resistance in E. coli. 
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The vector containing the appropriate DNA sequence as hereinabove 
described, as well as an appropriate promoter or control sequence, may be 
employed to transform an appropriate host to permit the host to express the 
5 protein. 

As representative examples of appropriate hosts, there may be 
mentioned: bacterial cells, such as E. coti , Streptomyces , Salmonella 
typhimurium ; fungal cells, such as yeast; insect cells such as Drosophila S2 
10 and Spodoptera Sf9 ; animal cells such as CHO, COS or Bowes melanoma; 
adenoviruses; plant cells, etc. The selection of an appropriate host is 
d eemec l t0 be within the scope of those skilled in the art from the teachings 
herein. 

15 More particularly, the present invention also includes recombinant 

constructs comprising one or more of the sequences as broadly described 
above. The constructs comprise a vector, such as a plasmid or viral vector, 
into which a sequence of the invention has been inserted, in a forward or 
reverse orientation. In a preferred aspect of this embodiment, the construct 

20 further comprises regulatory sequences, including, for example, a promoter, 
operably linked to the sequence. Large numbers of suitable vectors and 
promoters are known to those of skill in the art, and are commercially 
available. The following vectors are provided by way of example; Bacterial: 
pQE70, pQE60, pQE-9 (Qiagen), pBS, pD10, phagescript, phiX174, 

25 pBluescript SK, pBSKS, pNH8A, pNH16a, pNH18A, pNH46A (Stratagene); 
pTRC99a, pKK223-3, pKK233-3, pDR540, pRIT5 (Pharmacia); Eukaryotic: 
pWLNEO, pSV2CAT, pOG44, pXT1, pSG (Stratagene) pSVK3, pBPV, 
pMSG, pSVL (Pharmacia). However, any other plasmid or vector may be 
used as long as they are replicable and viable in the host. 

30 
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Promoter regions can be selected from any desired gene using CAT 
(chloramphenicol transferase) vectors or other vectors with selectable 
markers. Two appropriate vectors are pKK232-8 and pCM7. Particular 
named bacterial promoters include lacl, lacZ, T3, T7, gpt, lambda P R , P L and 
5 trp. Eukaryotic promoters include CMV immediate early, HSV thymidine 
kinase, early and late SV40, LTRs from retrovirus, and mouse 
metallothionein-l. Selection of the appropriate vector and promoter is well 
within the level of ordinary skill in the art. 

10 In a further embodiment, the present invention relates to host cells 

containing the above-described constructs. The host cell can be a higher 
eukaryotic cell, such as a mammalian cell, or a lower eukaryotic cell, such 
as a yeast cell, or the host cell can be a prokaryotic cell, such as a bacterial 
cell. Introduction of the construct into the host cell can be effected by 

15 calcium phosphate transfection, DEAE-Dextran mediated transfection, or 
electroporation (Davis, L., Dibner, M., Battey, I., Basic Methods in Molecular 
Biology, (1986)). 

The constructs in host cells can be used in a conventional manner to 
20 produce the gene product encoded by the recombinant sequence. 
Alternatively, the polypeptides of the invention can be synthetically 
produced by conventional peptide synthesizers. 

Mature proteins can be expressed in mammalian cells, yeast, 
25 bacteria, or other cells under the control of appropriate promoters. Cell-free 
translation systems can also be employed to produce such proteins using 
RNAs derived from the DNA constructs of the present invention. 
Appropriate cloning and expression vectors for use with prokaryotic and 
eukaryotic hosts are described by Sambrook, et al., Molecular Cloning: A 
30 Laboratory Manual, Second Edition, Cold Spring Harbor, N.Y., (1989), the 
disclosure of which is hereby incorporated by reference. 
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Transcription of the DNA encoding the polypeptides of the present 
invention by higher eukaryotes is increased by inserting an enhancer 
sequence into the vector. Enhancers are cis-acting elements of DNA, 
5 usually about from 10 to 300 bp that act on a promoter to increase its 
transcription. Examples include the SV40 enhancer on the late side of the 
replication origin bp 100 to 270, a cytomegalovirus early promoter 
enhancer, the polyoma enhancer on the late side of the replication origin, 
and adenovirus enhancers. 

10 

Generally, recombinant expression vectors will include origins of 
replication and selectable markers permitting transformation of the host cell, 
e.g., the ampicillin resistance gene of E. coli and S. cerevisiae Trp1 gene, 
and a promoter derived from a highly-expressed gene to direct transcription 

15 of a downstream structural sequence. Such promoters can be derived from 
operons encoding glycolytic enzymes such as 3-phosphoglycerate kinase 
(PGK), a-factor, acid phosphatase, or heat shock proteins, among others. 
The heterologous structural sequence is assembled in appropriate phase 
with translation initiation and termination sequences, and preferably, a 

20 leader sequence capable of directing secretion of translated protein into the 
periplasmic space or extracellular medium. Optionally, the heterologous 
sequence can encode a fusion protein including an N-terminal identification 
peptide imparting desired characteristics, e.g., stabilization or simplified 
purification of expressed recombinant product. 

25 

Useful expression vectors for bacterial use are constructed by 
inserting a structural DNA sequence encoding a desired protein together 
with suitable translation initiation and termination signals in operable reading 
phase with a functional promoter. The vector will comprise one or more 
30 phenotypic selectable markers and an origin of replication to ensure 
maintenance of the vector and to, if desirable, provide amplification within 
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the host. Suitable prokaryotic hosts for transformation include E. coli , 
Bacillus subtilis , Salmonella typhimurium and various species within the 
genera Pseudomonas, Streptomyces, and Staphylococcus, although others 
may also be employed as a matter of choice. 

5 

As a representative but nonlimiting example, useful expression 
vectors for bacterial use can comprise a selectable marker and bacterial 
origin of replication derived from commercially available plasmids comprising 
genetic elements of the well known cloning vector pBR322 (ATCC 37017). 
10 Such commercial vectors include, for example, pKK223-3 (Pharmacia Fine 
Chemicals, Uppsala, Sweden) and GEM1 (Promega Biotec, Madison, Wl, 
USA). These pBR322 "backbone" sections are combined with an 
appropriate promoter and the structural sequence to be expressed. 

15 Following transformation of a suitable host strain and growth of the 

host strain to an appropriate cell density, the selected promoter is induced 
by appropriate means (e.g., temperature shift or chemical induction) and 
cells are cultured for an additional period. 

20 Cells are typically harvested by centrifugation, disrupted by physical 

or chemical means, and the resulting crude extract retained for further 
purification. 

Microbial cells employed in expression of proteins can be disrupted 
25 by any convenient method, including freeze-thaw cycling, sonication, 
mechanical disruption, or use of cell lysing agents, such methods are well 
known to those skilled in the art. 

Various mammalian cell culture systems can also be employed to 
30 express recombinant protein. Examples of mammalian expression systems 
include the COS-7 lines of monkey kidney fibroblasts, described by 
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Gluzman, Cell, 23:175 (1981), and other cell lines capable of expressing a 
compatible vector, for example, the C127, 3T3, CHO, HeLa and BHK cell 
lines. Mammalian expression vectors will comprise an origin of replication, 
a suitable promoter and enhancer, and also any necessary ribosome binding 
5 sites, polyadenylation site, splice donor and acceptor sites, transcriptional 
termination sequences, and 5' flanking nontranscribed sequences. DNA 
sequences derived from the SV40 splice, and polyadenylation sites may be 
used to provide the required nontranscribed genetic elements. 

10 The polypeptide can be recovered and purified from recombinant cell 

cultures by methods including ammonium sulfate or ethanol precipitation, 
acid extraction, anion or cation exchange chromatography, 
phosphocellulose chromatography, hydrophobic interaction chromatography, 
affinity chromatography, hydroxylapatite chromatography and lectin 

15 chromatography. Protein refolding steps can be used, as necessary, in 
completing configuration of the mature protein. Finally, high performance 
liquid chromatography (HPLC) can be employed for final purification steps. 

The polypeptides of the present invention may be a naturally purified 
20 product, or a product of chemical synthetic procedures, or produced by 
recombinant techniques from a prokaryotic or eukaryotic host (for example, 
by bacterial, yeast, higher plant, insect and mammalian cells in culture). 
Depending upon the host employed in a recombinant production procedure, 
the polypeptides of the present invention may be glycosylated or may be 
25 non-glycosylated. Polypeptides of the invention may also include an initial 
methionine amino acid residue. 

The polypeptides of the present invention, when utilized for 
clinically related purposes, may also be suspended in a pharmacologically 
30 acceptable diluent or excipient to facilitate such uses, which will include 
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use as a vaccine for the purpose of preventing a wide variety of 
streptococcal and staphylococcal infections. 

In accordance with another aspect of the present invention, there is 
5 provided a vaccine that includes at least one polypeptide that is at least 
75% identical, preferably at least 90% identical and most preferably 95% 
identical, to a polypeptide sequence comprising the sequence of SEQ ID 
NO: 2, 4, or 6. Such variations in homology for putative vaccines are well 
known in the art (See, for example, Hanson et al., "Active and Passive 
10 Immunity Against Borrelia burgdorferi Decorin Binding Protein A (DbpA)," 
Infection and Immunity , (May) 1998, p. 2143 - 2153; Roberts et a!., 
"Heterogeneity Among Genes Including Decorin Binding Proteins A and B of 
Borrelia burgdorferi sensu lato," Infection and Immunity , (Nov) 1998, p. 
5275-5285). Such observations would similarly apply to portions, segments 
15 or fragments of the polypeptides disclosed herein. 

Such segments find a multitude of uses. For example, such segments 
of the polypeptides according to the present invention find use as 
intermediates in the synthesis of higher molecular weight structures also 
20 within the present invention. 

The term "active fragment" means a fragment that generates an 
immune response (i.e., has immunogenic activity) when administered, alone 
or optionally with a suitable adjuvant, to an animal, such as a mammal, for 
25 example, a rabbit or a mouse, and also including a human. 

In accordance with a further aspect of the invention, a vaccine of the 
type hereinabove described is administered for the purpose of preventing or 
treating infection caused by streptococci and staphylococci as well as many 
30 related organisms. 
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A vaccine in accordance with the present invention may include one 
or more of the hereinabove described polypeptides or active fragments 
thereof. When employing more than one polypeptide or active fragment, 
such as two or more polypeptides and/or active fragments may be used as 
5 a physical mixture or as a fusion of two or more polypeptides or active 
fragments. The fusion fragment or fusion polypeptide may be produced, for 
example, by recombinant techniques or by the use of appropriate linkers for 
fusing previously prepared polypeptides or active fragments. 

10 In many cases, the variation in the polypeptide or active fragment is a 

conservative amino acid substitution, although other substitutions are within 
the scope of the invention. 

In accordance with the present invention, a polypeptide variant 
15 includes variants in which one or more amino acids are substituted and/or 
deleted and/or inserted. 

In another aspect, the invention relates to passive immunity vaccines 
formulated from antibodies against a polypeptide or active fragment of a 

20 polypeptide of the present invention. Such passive immunity vaccines can 
be utilized to prevent and/or treat streptococcal and staphylococcal 
infections in patients. In this manner, according to a further aspect of the 
invention, a vaccine can be produced from a synthetic or recombinant 
polypeptide of the present invention or an antibody against such 

25 polypeptide. 

Still another aspect the present invention relates to a method of using 
one or more antibodies (monoclonal, polyclonal or sera) to the polypeptides 
of the invention as described above for the prophylaxis and/or treatment of 
30 diseases that are caused by streptococcal and staphylococcal bacteria. In 
particular, the invention relates to a method for the prophylaxis and/or 
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treatment of infectious diseases that are caused by streptococci and 
staphylococci. In a still further preferred aspect, the invention relates to a 
method for the prophylaxis and/or treatment of such diseases as necrotizing 
fasciitis, scarlet fever, sepsis and many diseases of newborns, in humans 
5 by utilizing a vaccine of the present invention. 



Generally, vaccines are prepared as injectables, in the form of 
aqueous solutions or suspensions. Vaccines in an oil base are also well 
known such as for inhaling. Solid forms which are dissolved or suspended 
10 prior to use may also be formulated. Pharmaceutical carriers, diluents and 
excipients are generally added that are compatible with the active 
ingredients and acceptable for pharmaceutical use. Examples of such 
carriers include, but are not limited to, water, saline solutions, dextrose, or 
glycerol. Combinations of carriers may also be used. 

15 

Vaccine compositions may further incorporate additional substances 
to stabilize pH, or to function as adjuvants, wetting agents, or emulsifying 
agents, which can serve to improve the effectiveness of the vaccine. 

20 Vaccines are generally formulated for parenteral administration and 

are injected either subcutaneously or intramuscularly. Such vaccines can 
also be formulated as suppositories or for oral administration, using 
methods known in the art, or for administration through nasal or respiratory 
routes. 

25 

The amount of vaccine sufficient to confer immunity to pathogenic 
bacteria is determined by methods well known to those skilled in the art. 
This quantity will be determined based upon the characteristics of the 
vaccine recipient and the level of immunity required. Typically, the amount 
30 of vaccine to be administered will be determined based upon the judgment 
of a skilled physician. Where vaccines are administered by subcutaneous or 
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intramuscular injection, a range of 0.5 to 500 jag purified protein may be 
given. 

The present invention is also directed to a vaccine in which a 
5 polypeptide or active fragment of the present invention is delivered or 
administered in the form of a polynucleotide encoding the polypeptide or 
active fragment, whereby the polypeptide or active fragment is produced in 
vivo. The polynucleotide may be included in a suitable expression vector 
and combined with a pharmaceutical^ acceptable carrier. 

10 

Thus, the present invention expressly contemplates a vaccine 
composition comprising any of the polypeptides disclosed herein, said 
polypeptide being present in an amount effective to produce an immune 
response, and wherein said polypeptide is suspended in a pharmacologically 
15 acceptable carrier, diluent or excipient. 

The vaccine compositions of the present invention may also comprise 
live vaccines, containing such organisms as Steptococcus gordoniae and 
Salmonella typhi, wherein said organisms contain recombinant polypeptides 
20 as disclosed herein. 

In addition, the polypeptides of the present invention can be used as 
immunogens to stimulate the production of antibodies for use in passive 
immunotherapy, for use as diagnostic reagents, and for use as reagents in 
25 other processes such as affinity chromatography. 

Thus, the present invention is also directed to methods for the 
prevention of a wide variety of diseases caused by streptococcal and 
staphylococcal organisms, said methods involving the administering of 
30 vaccines disclosed herein to animals at risk of such diseases, especially 
where said animals are humans. 
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In addition, the invention disclosed herein is also directed to a 
means of treating animals, especially humans, afflicted with a disease 
caused by the organisms from which the isolated polypeptides of the 
5 invention are derived, such methods including, but not being limited to, 
administering to an animal, especially a human, afflicted with such a 
disease of a therapeutically effective amount of an antibody, or mixture of 
antibodies, against the polypeptides disclosed herein. 

10 Antibodies specific for the polypeptides disclosed herein may be 

either polyclonal or monoclonal and may even be in the form of antisera. 
When such antibodies are monoclonal in nature, they may be produced by 
conventional methods of preparing monoclonal antibodies, such as from 
conventional hybridoma cells, and may also be produced by genetically 

15 engineered cells transformed with vectors containing genes specifically 
coding for the different heavy and light chains of antibody molecules 
having an arrangement of variable regions specifically complementary to 
one or more of the polypeptides of the invention. Such recombinantly 
produced antibodies may be in the form of either dimers or tetramers, 

20 depending on the type of cellular expression system utilized therefor. 

The invention will now be further described in more detail in the 
following non-limiting examples and it will be appreciated that additional 
and different embodiments of the teachings of the present invention will 
25 doubtless suggest themselves to those of skill in the art and such other 
embodiments are considered to have been inferred from the disclosure 
herein. 



30 
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Example 1 

Southern Blot Analysis of Chromosomal DNA Using Probes Specific for the 
Sp36 Gene of Streptococcus pneumoniae 

5 

Genomic DNA was isolated from Staphylococcus aureus, 
Streptococcus pyogenes (group A), and Streptococcus agalactiae (group B) 
after overnight growth of the bacteria. The DNA was digested to 
completion by overnight incubation with restriction enzymes (BamYW and 

10 PvuW), and then DNA fragments were resolved by size by agarose gel 
electrophoresis before transfer to a nylon membrane. The membrane was 
then probed with DNA encoding the entire Sp36 open reading frame that 
had been fluorescein-labeled with random primers using a kit from 
Amersham Pharmacia Biotech Inc. The hybridization and washes were 

15 carried out under low stringency conditions (i.e., 45°C, 5xSSC 
hybridization; 45°C, 1xSSC for 1 st wash; 45°C, O.BxSSC for 2 nd wash). 
Here, SSC is composed of 150 mM NaCI and 15 nM sodium citrate, pH 7.0 
and all washes are 50 ml_ each. 

20 After hybridization and washing was complete, the bound, 

fluorescein-labeled probe was detected using an anti-fluorescein antibody as 
per the manufacturer's instructions with the kit. Similarly digested DNA 
from Streptococcus pneumoniae strain SJ2 (serotype 6B) was used as a 
positive control. Fluorescein-labeled bacteriophage lambda DNA digested 

25 with the restriction nuclease Hind\\\ was used as a size marker. 

The Sp36 probe hybridized with a single fragment in the digested S. 
aureus DNA (-4.5 kb BamYW fragment, -5 kb PvuW fragment) and with 2 
major fragments in a PvuW digest of serotype M1 of the group A 
30 streptococci genomic DNA (-4.0 kb, and -4.2 kb ). 
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Example 2 

BLAST Analysis Using Sp36 Predicted Amino Acid Sequence 

5 Sequence comparisons of the Sp36 encoded protein sequence 

against the publicly available GenBank sequence database (including the 
unfinished microbial database 

(http://www.ncbi.nlm.nih.gov/BLAST/unfinishedgenome.html)) revealed two 
highly homologous amino acid sequences. One of these was a predicted 
10 amino acid sequence from the S. pyogenes genome. This predicted 
polypeptide comprised 825 amino acid residues (MW = 92,616 Da) that 
was 25.1 % identical to the Sp36 amino acid sequence from pneumococcus 
serotype 4 but maintained the 5 histidine triads (underlined in Figure 5(a) - 
SEQ ID NO: 2). The second polypeptide encoded within the S. pyogenes 
15 database contained several errors that were corrected by our sequencing of 
this region of the genome. The DNA fragment obtained encoded a protein of 
792 amino acids (MW = 87,457 Da) that was 12.6% identical to the 
pneumococcal sequence and 12.5% identical to the first S. pyogenes 
polypeptide. This predicted amino acid sequence contained four histidine 
20 triad motifs (underlined in Fig. 5(b) - SEQ ID NO.: 4). The third polypeptide 
sequence obtained was one already in the database (Accession No. 
AF062533) and identified only as an unknown gene downstream from a 
gene identified as Imb in S. galactiae. This 822 amino acid protein thus has 
a predicted molecular weight of 92,353 Da and maintains the 5 histidine 
25 triad motifs (underlined in Figure 5(C) - SEQ ID NO: 6). This second 
polypeptide shows 25.6% sequence identity to Sp36 of pneumococcus 
type 4 and 97.7% and 11.6% identity to the two group A homologs, 
respectively. 

30 
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Example 3 

Southern Blot Analysis Using a group A Streptococcal Sp36 Homolog Probe 

5 Southern blot analysis was performed with a fluorescein-labeled DNA 

fragment as probe, which encoding a group A streptococcal Sp36 homolog 
cloned from an M1 serotype of the group A streptococcal genome. This 
fragment was then used to probe genomic DNA from an M6 serotype of the 
group A streptococcal genome, as well as serotype 1a and serotype 3 of 

10 the group B streptococcal genome, and strain SJ2 (serotype 6B) of 
pneumococcus. In all cases, a single band was obtained in DNA digested 
with BamYW when hybridization was carried out under low stringency 
conditions (as described above). A band of about 20 kb was visualized in 
group A streptococcal DNA, about 4.5 kb was obtained for group B 

15 streptococcal DNA, and a band of about 4kb was seen for pneumococcus. 



Example 4 

20 

Western Blot Analysis of Reactivity of group B Streptococcal Homolog With 
Anti-Pneumococcal Sp36 Antiserum 

To determine whether antiserum raised against recombinant Sp36 
25 from S. pneumoniae would recognize the recombinant Sp36 homolog 
encoded by group B streptococcal organisms, a western blot was 
performed. One hundred nanograms (100 ng) of recombinant Sp36 
polypeptide cloned from either S. pneumoniae serotype 4, or of the Sp36 
homolog cloned from group B streptococcal organisms, or from an unrelated 
30 recombinant protein control expressed and purified in the same way, were 
subjected to SDS-PAGE containing 12% acrylamide. A cell lysate of 
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pneumococcal strain SJ2 (serotype 6B) was also included on the gel. After 
electrophoresis, the separated proteins were transferred to a nitrocellulose 
membrane and probed with rabbit polyclonal antiserum raised against the 
recombinant pneumococcal protein. Bound antibodies were detected 

5 chemiluminescently with a goat anti-rabbit IgG antibody conjugated to 
horseradish peroxidase using the substrate ECL (from Amersham). The 
results demonstrate that antiserum raised against the pneumococcal Sp36 
protein cross-react with the Sp36 homolog identified from the group B 
streptococci and thereby indicating conservation of epitopes between the 

10 proteins. The group B streptococcal homolog is also approximately the same 
size as the protein detected in S. pneumoniae lysates. Because the group A 
and B homologs are highly homologous, if not identical, such antiserum 
would also likely cross-react with the group A streptococcal protein. 

15 

Example 5 

Alignment of Predicted Amino Acid Sequences of the Sp36 Homologs from 
group A and B Streptococci With Pneumococcal Sp36 

20 

The predicted amino acid sequences from the Sp36 genes from 
group A and group B streptococci and S. pneumoniae were aligned using 
the Clustal algorithm in a DNAStar Computer package (DNAStar, Inc., 
Madison, Wl). Amino acids that match those encoded by the pneumococcal 
25 gene are boxed in Figure 2 (showing the results of the alignment). Gaps 
introduced in the sequence by the alignment process are indicated by 
dashed lines. 

30 

Example 6 
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Percentage Sequence Identity Between Homologs of Sp36 

The Sp36 amino acid sequence from pneumococci is 25.6% identical 
to the predicted amino acid sequence of the homologous gene of group B 

5 streptococci and 25.1% and 12.6% identical to the deduced sequences of 
the two genes from group A streptococci. Furthermore, the group B 
homolog is 97.7% and 11.6% identical to the first (GAS36) and second 
(GAS36(2)) homologs from group A streptococci, respectively. These 
experiments indicate that homologous genes to Sp36 from pneumococcus 

10 are present in group A and group B streptococci, as well as in 
Staphylococcus aureus. The protein encoded by this gene may therefore 
perform a similar function in these different organisms. This suggests that a 
vaccine comprising one or more of these proteins may be broadly protective 
against these species. These results are summarized in Table 1 which 

15 shows the percent identity between the amino acid sequences of Sp36 
from pneumococcus strain Norway 4 (serotype 4), group A streptococci 
Sp36 homolog from an M1 serotype, and group B streptococci Sp36 from 
strain R268. 



20 Table 1 . 

Pneumo. Sp36 

Pneumo. Sp36 100% 
GAS36 
25 GAS36(2) 
GBS36 



GAS36 GAS36(2) GBS36 

25.1% 12.6% 25.6% 
100% 97.7% 

100% 11.6% 

100% 



where GAS36 = SEQ ID NO: 2 
30 GAS36(2) = SEQ ID NO: 4 

GBS36 = SEQ ID NO: 6 
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WHAT IS CLAIMED IS: 

1 . An isolated polypeptide comprising an amino acid sequence at 
least 75% identical to a sequence selected from the group consisting of 

5 SEQ ID NO: 2, 4 and 6. 

2. The isolated polypeptide of claim 1 wherein said polypeptide is at 
least 90% identical to the sequence selected from the group consisting of 
SEQ ID NO: 2, 4, and 6. 

10 

3. The isolated polypeptide of claim 1 wherein said polypeptide is at 
least 95% identical to the sequence selected from the group consisting of 
SEQ ID NO: 2, 4, and 6. 

15 4. The isolated polypeptide of claim 1 wherein said polypeptide has 

the amino acid sequence selected from the group consisting of SEQ ID NO: 
2, 4 and 6. 

5. The isolated polypeptide of claim 1 wherein said polypeptide is 
20 found in an organism selected from the group consisting of group A 

streptococci, group B streptococci, and Staphylococcus aureus. 

6. The isolated polypeptide of claim 5 wherein the group A 
streptococcal organism is Streptococcus pyogenes. 

25 

7. The isolated polypeptide of claim 5 wherein the group B 
streptococcal organism is Streptococcus agalactiae. 

8. The isolated polypeptide of claim 1 wherein said polypeptide has a 
30 sequence at least 25% identical to the amino acid sequence of the Sp36 

protein of Streptococcus pneumoniae. 
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9. An isolated polynucleotide comprising a sequence coding for a 
polypeptide selected from the group consisting of the polypeptides of claims 
1 , 2, 3, 4, 5, 6, 7, and 8. 

5 

10. The isolated polynucleotide of claim 9 wherein said 
polynucleotide has a nucleotide sequence selected from the group 
consisting of SEQ ID NO: 1, 3 and 5. 

10 11. An antibody specific for a polypeptide selected from the group 

consisting of the polypeptides of claims 1 , 2, 3, 4, 5, 6, 7, and 8. 

12. The antibody of claim 11 wherein said antibody is a monoclonal 
antibody. 

15 

1 3. A genetically engineered cell producing the antibody of claim 12. 

14. A vector comprising the polynucleotide of claim 9. 

20 1 5. A vector comprising the polynucleotide of claim 10. 

16. A genetically engineered cell expressing the polypeptide coded 
for by the polynucleotide of claim 9 or 10. 

25 17. A composition comprising a polypeptide selected from the group 

consisting of the polypeptides of claims 1 , 2, 3, 4, 5, 6, 7, and 8, said 
polypeptide being suspended in a pharmacologically acceptable diluent or 
excipient. 

30 18. A vaccine composition comprising a polypeptide selected from 

the group consisting of the polypeptide of claims 1, 2, 3, 4, 5, 6, 7, and 8, 
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said, polypeptide being present in an amount effective to produce an 
immune response, and wherein said polypeptide is suspended in a 
pharmacologically acceptable carrier, diluent or excipient. 

5 19. A vaccine comprising an immunogenically active amount of the 

composition of claim 17. 

20. A method of vaccinating an animal against infection by a 
bacterial organism selected from the group consisting of streptococcal 

10 bacteria and staphylococcal bacteria comprising administering to said animal 
an immunologically effective amount of the vaccine of claim 19. 

21 . The method of claim 20 wherein said animal is a human. 

15 22. A method of treating a disease comprising administering to an 

animal afflicted therewith of a therapeutically effective amount of an 
antibody of claim 1 2 wherein said antibody is suspended in a 
pharmacologically acceptable carrier, diluent or excipient. 

20 23. The method of claim 22 wherein said animal is a human. 

24. The method of claim 22 wherein said disease is caused by an 
organism selected from the group consisting of group A streptococci, group 
B streptococci, and Staphylococcus aureus. 

25 



30 
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Figure 1 
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Figure 2 (a) 
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Figure 2 (b) 
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Figure 2 (c) 
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Figure 3 
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Figure 4 
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SEQUENCE LISTING 

<110> Heinrichs, Jon 

Johnson, Leslie S. 
Koeni g , S cot t 
Adamou , John E . 

<120> Pneumococcal Protein Homologs and Fragments for 
Vaccines 

<130> 469201-402 

<140> 
<141> 

<150> U.S. 60/150,750 
<151> 1999-08-25 

<160> 6 

<170> Patentln Ver . 2.1 

<210> 1 
<211> 2478 
<212> DNA 

<213> Streptococcus pyogenes 
<400> 1 

gtgaagaaaa catatggtta tatcggctca gttgctgcta ttttactagc tactcatatt 6 0 
ggaagttacc aacttggtaa gcatcatatg ggttcagcaa caaaggacaa tcaaattgcc 12 0 
tatattgatg atagcaaagg taaggcaaaa gcccctaaaa caaacaaaac gatggatcaa 18 0 
atcagtgctg aagaaggcat ctctgctgaa cagatcgtag tcaaaattac tgaccaaggc 24 0 
tatgtgacct cacatggtga ccattatcat ttttacaatg ggaaagttcc ttatgatgcg 3 00 
attattagtg aagagttgtt gatgacggat cctaattacc gttttaaaca atcagacgtt 3 60 
atcaatgaaa tcttagacgg ttacgttatt aaagtcaatg gcaactatta tgtttacctc 42 0 
aagccaggta gcaagcgcaa aaacattcga accaaacaac aaattgctga gcaagtagcc 480 
aaaggaacta aagaagctaa agaaaaaggt ttagctcaag tggcccatct cagtaaagaa 54 0 
gaagttgcgg cagtcaatga agcaaaaaga caaggacgct atactacaga cgatggctat 600 
atttttagtc cgacagatat cattgatgat ttaggagatg cttatttagt acctcatggt 660 
aatcactatc attatattcc taaaaaggat ttgtctccaa gtgagctagc tgctgcacaa 720 
gcctactgga gtcaaaaaca aggtcgaggt gctagaccgt ctgattaccg cccgacacca 780 
gccccagccc caggtcgtag gaaagcccca attcctgatg tgacgcctaa ccctggacaa 84 0 
ggtcatcagc cagataacgg tggctatcat ccagcgcctc ctaggccaaa tgatgcgtca 900 
caaaacaaac accaaagaga tgagtttaaa ggaaaaacct ttaaggaact tttagatcaa 96 0 
ctacaccgtc ttgatttgaa ataccgtcat gtggaagaag atgggttgat ttttgaaccg 102 0 
actcaagtga tcaaatcaaa cgcttttggg tatgtggtgc ctcatggaga tcattatcat 1080 
attatcccaa gaagtcagtt atcacctctt gaaatggaat tagcagatcg atacttagcc 1140 
ggccaaactg aggacgatga ctcaggttca gatcactcaa aaccatcaga taaagaagtg 12 0 0 
acacatacct ttcttggtca tcgcatcaaa gcttacggaa aaggcttaga tggtaaacca 1260 
tatgatacga gtgatgctta tgtttttagt aaagaatcca ttcattcagt ggataaatca 1320 
ggagttacag ctaaacacgg agatcatttc cactatatag gatttggaga acttgaacaa 13 8 0 
tatgagttgg atgaggtcgc taactgggtg aaagcaaaag gtcaagctga tgagcttgct 144 0 
gctgctttgg atcaggaaca aggcaaagaa aaaccactct ttgacactaa aaaagtgagt 1500 
cgcaaagtaa caaaagatgg taaagtgggc tatatgatgc caaaagatgg caaggactat 1560 
ttctatgctc gtgatcaact tgatttgact cagattgcct ttgccgaaca agaactaatg 1620 
cttaaagata agaaacatta ccgttatgac attgttgaca caggtattga gccacgactt 1680 
gctgtagatg tgtcaagtct gccgatgcat gctggtaatg ctacttacga tactggaagt 1740 
tcgtttgtta tccctcatat tgatcatatc catgtcgttc cgtattcatg gttgacgcgc 1800 
gatcagattg caacaatcaa gtatgtgatg caacaccccg aagttcgtcc ggatatatgg 1860 
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tctaagccag ggcatgaaga gtcaggttcg gtcattccaa atgttacgcc tcttgataaa 1920 
cgtgctggta tgccaaactg gcaaattatc cattctgctg aagaagttca aaaagcccta 198 0 
gcagaaggtc gttttgcaac accagacggc tatattttcg atccacgaga tgttttggcc 2 04 0 
aaagaaactt ttgtatggaa agatggctcc tttagcatcc caagagcaga tggcagttca 2100 
ttgagaacca ttaataaatc tgatctatcc caagctgagt ggcaacaagc tcaagagtta 216 0 
ttggcaaaga aaaacgctgg tgatgctact gatacggata aacccaaaga aaagcaacag 222 0 
gcagataaga gcaatgaaaa ccaacagcca agtgaagcca gtaaagaaga agaaaaagaa 228 0 
tcagatgact ttatagacag tttaccagac tatggtctag atagagcaac cctagaagat 234 0 
catatcaatc aattagcaca aaaagctaat atcgatccta agtatctcat tttccaacca 24 0 0 
gaaggtgtcc aattttataa taaaaatggt gaattggtaa cttatgatat caagacactt 246 0 
caacaaataa acccttaa 2478 



<210> 2 

<211> 825 

<212> PRT 

<213> Streptococcus pyogenes 



<400> 2 

Val Lys Lys Thr 
1 

Ala Thr His lie 

20 

Ala Thr Lys Asp 

35 

Ala Lys Ala Pro 
50 

Glu Gly lie Ser 
65 

Tyr Val Thr Ser 



Pro Tyr Asp Ala 

100 

Tyr Arg Phe Lys 
115 

Val lie Lys Val 
130 

Lys Arg Lys Asn 
145 

Lys Gly Thr Lys 



Tyr Gly Tyr lie 
5 

Gly Ser Tyr Gin 



Asn Gin lie Ala 

40 

Lys Thr Asn Lys 

55 

Ala Glu Gin lie 
70 

His Gly Asp His 
85 

lie lie Ser Glu 



Gin Ser Asp Val 

120 

Asn Gly Asn Tyr 
135 

lie Arg Thr Lys 
150 

Glu Ala Lys Glu 
165 



Gly Ser Val Ala 
10 

Leu Gly Lys His 
25 

Tyr lie Asp Asp 



Thr Met Asp Gin 

60 

Val Val Lys lie 

75 

Tyr His Phe Tyr 
90 

Glu Leu Leu Met 
105 

lie Asn Glu lie 



Tyr Val Tyr Leu 

140 

Gin Gin He Ala 
155 

Lys Gly Leu Ala 
170 



Ala He Leu Leu 

15 

His Met Gly Ser 
30 

Ser Lys Gly Lys 
45 

lie Ser Ala Glu 



Thr Asp Gin Gly 

80 

Asn Gly Lys Val 

95 

Thr Asp Pro Asn 
110 

Leu Asp Gly Tyr 
125 

Lys Pro Gly Ser 



Glu Gin Val Ala 

160 

Gin Val Ala His 
175 



Leu Ser Lys Glu 

180 

Arg Tyr Thr Thr 
195 



Glu Val Ala Ala 

Asp Asp Gly Tyr 

200 



Val Asn Glu Ala 
185 

lie Phe Ser Pro 



Lys Arg Gin Gly 
190 

Thr Asp He He 
205 



Asp Asp Leu Gly Asp Ala Tyr Leu Val Pro His Gly Asn His Tyr His 
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210 215 220 

Tyr lie Pro Lys Lys Asp Leu Ser Pro Ser Glu Leu Ala Ala Ala Gin 
225 230 235 240 

Ala Tyr Trp Ser Gin Lys Gin Gly Arg Gly Ala Arg Pro Ser Asp Tyr 

245 250 255 

Arg Pro Thr Pro Ala Pro Ala Pro Gly Arg Arg Lys Ala Pro lie Pro 

260 265 270 

Asp Val Thr Pro Asn Pro Gly Gin Gly His Gin Pro Asp Asn Gly Gly 
275 280 285 

Tyr His Pro Ala Pro Pro Arg Pro Asn Asp Ala Ser Gin Asn Lys His 
290 295 300 

Gin Arg Asp Glu Phe Lys Gly Lys Thr Phe Lys Glu Leu Leu Asp Gin 
305 310 315 320 

Leu His Arg Leu Asp Leu Lys Tyr Arg His Val Glu Glu Asp Gly Leu 

325 330 335 

lie Phe Glu Pro Thr Gin Val lie Lys Ser Asn Ala Phe Gly Tyr Val 

340 345 350 

Val Pro His Gly Asp His Tyr His lie lie Pro Arg Ser Gin Leu Ser 
355 360 365 

Pro Leu Glu Met Glu Leu Ala Asp Arg Tyr Leu Ala Gly Gin Thr Glu 
370 375 380 

Asp Asp Asp Ser Gly Ser Asp His Ser Lys Pro Ser Asp Lys Glu Val 
385 390 395 400 

Thr His Thr Phe Leu Gly His Arg lie Lys Ala Tyr Gly Lys Gly Leu 

405 410 415 

Asp Gly Lys Pro Tyr Asp Thr Ser Asp Ala Tyr Val Phe Ser Lys Glu 

420 425 430 

Ser lie His Ser Val Asp Lys Ser Gly Val Thr Ala Lys His Gly Asp 
435 440 445 

His Phe His Tyr lie Gly Phe Gly Glu Leu Glu Gin Tyr Glu Leu Asp 
450 455 460 

Glu Val Ala Asn Trp Val Lys Ala Lys Gly Gin Ala Asp Glu Leu Ala 
465 470 475 480 

Ala Ala Leu Asp Gin Glu Gin Gly Lys Glu Lys Pro Leu Phe Asp Thr 

485 490 495 

Lys Lys Val Ser Arg Lys Val Thr Lys Asp Gly Lys Val Gly Tyr Met 

500 505 510 

Met Pro Lys Asp Gly Lys Asp Tyr Phe Tyr Ala Arg Asp Gin Leu Asp 
515 520 525 
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Leu Thr Gin lie Ala Phe Ala Glu Gin Glu Leu Met Leu Lys Asp Lys 
530 535 540 

Lys His Tyr Arg Tyr Asp lie Val Asp Thr Gly lie Glu Pro Arg Leu 
545 550 555 560 

Ala Val Asp Val Ser Ser Leu Pro Met His Ala Gly Asn Ala Thr Tyr 

565 570 575 

Asp Thr Gly Ser Ser Phe Val lie Pro His lie Asp His lie His Val 

580 585 590 

Val Pro Tyr Ser Trp Leu Thr Arg Asp Gin lie Ala Thr lie Lys Tyr 
595 600 605 

Val Met Gin His Pro Glu Val Arg Pro Asp lie Trp Ser Lys Pro Gly 
610 615 620 

His Glu Glu Ser Gly Ser Val lie Pro Asn Val Thr Pro Leu Asp Lys 
625 630 635 640 

Arg Ala Gly Met Pro Asn Trp Gin lie lie His Ser Ala Glu Glu Val 

645 650 655 

Gin Lys Ala Leu Ala Glu Gly Arg Phe Ala Thr Pro Asp Gly Tyr lie 

660 665 670 

Phe Asp Pro Arg Asp Val Leu Ala Lys Glu Thr Phe Val Trp Lys Asp 
675 680 685 

Gly Ser Phe Ser lie Pro Arg Ala Asp Gly Ser Ser Leu Arg Thr lie 
690 695 700 

Asn Lys Ser Asp Leu Ser Gin Ala Glu Trp Gin Gin Ala Gin Glu Leu 
705 710 715 720 

Leu Ala Lys Lys Asn Ala Gly Asp Ala Thr Asp Thr Asp Lys Pro Lys 

725 730 735 

Glu Lys Gin Gin Ala Asp Lys Ser Asn Glu Asn Gin Gin Pro Ser Glu 

740 745 750 

Ala Ser Lys Glu Glu Glu Lys Glu Ser Asp Asp Phe lie Asp Ser Leu 
755 760 765 

Pro Asp Tyr Gly Leu Asp Arg Ala Thr Leu Glu Asp His lie Asn Gin 
770 775 780 

Leu Ala Gin Lys Ala Asn lie Asp Pro Lys Tyr Leu lie Phe Gin Pro 
785 790 795 800 

Glu Gly Val Gin Phe Tyr Asn Lys Asn Gly Glu Leu Val Thr Tyr Asp 

805 810 815 

lie Lys Thr Leu Gin Gin lie Asn Pro 

820 825 



<210> 3 
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<211> 2379 
<212> DNA 

<213> Streptococcus pyogenes 
<400> 3 

atgaaaacga aaaaagttat tattttagtt 
atagcttgtc aatcacgagg taatggtaca 
ggaatgacgt caaacaaaat taaaccgatt 
aaaggtgtgg cgggtgtcga ttttcctaca 
aaaatcttat caaaaacaga tcagggaatc 
attttttatg ccgatttaaa gggaagtcca 
ttagctaagc cagctgttgc tcagcgagca 
cctcatcacc attatgaatt taacccagcg 
acggttcgcc acgatgatca cttccattat 
caggcacaag ctaaacaggt tgctactcgc 
gctacagcta atggtattcc aggcttgcat 
ggtcaaggta ttgttggggt aacaaaagac 
catcctattt cttttgcgga ccttcgtcag 
gatcccgcta aaaaagcaga aaagccagca 
cgtgaaaagg aataccaaga aaaattagct 
tcaactatta aacgtgtgga aacacaagac 
gaccacgcac acgtattgat gttatctgat 
catgctattg agcatgcccg tgaattggaa 
gccttagggt ttgatgaaga agtgattttg 
ccattcccat caaatgaaaa agatccgaat 
aaacttgact tgggcagccg taaagatcct 
aacttagaaa ctttaggaat tggctttaca 
tttaaaaaat tgaaacagtt gttaatgaca 
gataatatgc cacagttaga aggcattgat 
ttcttgagca aatataaaaa cttaactcta 
attaggccgc ttggtcaatt accaaatctc 
tctgatttaa gcccactggc atcgttacat 
cagattacag atttaagccc tgtttctcat 
agaaatgctg atgttgactt agcaacactt 
aatgatacca aggtttctca tttggatttc 
tctattaacc gtgcgcaatt gcaatctctt 
agagtagaag cagaaggtaa ccaaattaaa 
cttactttct tggatgtgac aggcaaccag 
acagcacttg acattttaag cgtgtctaaa 
cccaataaga cagttactaa cattgatatt 
aaattgaacg agcaacatat tccagaagcc 
ggttctatgg taggtaatgg aacagctgaa 
gaaagtgctc aagaagcatc ggaatcacat 
gaagaaggtc atgctcacga gcacagagac 
gaaaatgaag ctaaagatga gcaaaaccat 



ggtctattgt tatcatctca gttgactttg 60 
tatcccatta aaacgaaaca atcacgtaag 120 
aaaaaaagca aaaagacaaa caagactcac 18 0 
gatgatgggt ttattttaac caaagactca 240 
gttgttgacc atgatggtca ttcgcatttt 3 00 
tttgaatacc ttattccaaa aggagcaagt 3 60 
gctagtcaag ggacttctaa agtagcagat 420 
gatattgtgg ctgaagatgc tttaggctac 480 
attttgaagt caagcttatc aggtcagaca 54 0 
ttgccacaaa ccagtagcct tgtttcaaca 600 
ttcccaacct cagatggttt tcaatttaac 660 
agtattttag tggaccacga tggtcactta 720 
ggtggctggg cacatgtggc agatcaatac 780 
gaaacccatc agacaccaga gctatctgaa 84 0 
tatttggcag aaaaattggg gattgatcca 900 
ggtaaacttg gtttggaata ccctcaccat 960 
attgaaatcg gaaaagacat tccagatcca 102 0 
aaacataagg ttggaatgga taccttgcgt 108 0 
gatatcgttc gcactcacga tgctccaacc 1140 
atgatgaaag aatggttagc aacggttatc 12 0 0 
ttgcaacgta aaggactttc actgttaccc 1260 
ccaatcaaag atatctcacc tgttttgcaa 13 2 0 
aaaacagggg tgactgatta tagatttttg 13 8 0 
atttcacaaa acaatctcaa agatattagt 144 0 
gtagcggctg ctgataatgg tattgaagat 1500 
aaattcctcg tattgagtaa caataagatt 1560 
caattgcaag aattgcacat tgataataat 162 0 
aaagaatcat tgacggttgt tgatttatca 168 0 
caagcaccca aattagaaac gttaatggtc 1740 
ttgaaaaata atcctaatct atctagccta 1800 
gaaggtattg aagcaagtag cgtcattgtc 186 0 
tcgcttgtgc ttaaagacaa gcaagggtca 1920 
ttgacttctc tagaaggtgt taataatttt 1980 
aaccaattaa caaatgtcaa cctatctaaa 2 04 0 
agtcataaca atatctcatt agcagacctt 2100 
attgcgaaaa acttcccagc ggtttacgaa 2160 
gaaaaagcag ctatggctac taaggcgaaa 222 0 
gactacaacc ataatcatac ctatgaagat 228 0 
aaagatgatc acgaccatga acatgaggat 23 4 0 
gctgactaa 2379 



<210> 4 
<211> 792 
<212> PRT 

<213> Streptococcus pyogenes 
<400> 4 

Met Lys Thr Lys Lys Val lie lie Leu Val Gly Leu Leu Leu Ser Ser 
15 10 15 

Gin Leu Thr Leu lie Ala Cys Gin Ser Arg Gly Asn Gly Thr Tyr Pro 

20 25 30 
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lie Lys Thr Lys 

35 

Pro lie Lys Lys 
50 

Gly Val Asp Phe 
65 

Lys lie Leu Ser 



His Ser His Phe 

100 

Tyr Leu lie Pro 
115 

Arg Ala Ala Ser 
130 

Tyr Glu Phe Asn 
145 

Thr Val Arg His 



Ser Gly Gin Thr 

180 

Gin Thr Ser Ser 
195 

Leu His Phe Pro 
210 

Val Gly Val Thr 
225 

His Pro lie Ser 



Ala Asp Gin Tyr 

260 

His Gin Thr Pro 
275 

Leu Ala Tyr Leu 
290 

Arg Val Glu Thr 
305 

Asp His Ala His 



lie Pro Asp Pro 

340 



Gin Ser Arg Lys 

40 

Ser Lys Lys Thr 

55 

Pro Thr Asp Asp 
70 

Lys Thr Asp Gin 
85 

lie Phe Tyr Ala 



Lys Gly Ala Ser 

120 

Gin Gly Thr Ser 
135 

Pro Ala Asp lie 
150 

Asp Asp His Phe 
165 

Gin Ala Gin Ala 



Leu Val Ser Thr 

200 

Thr Ser Asp Gly 
215 

Lys Asp Ser lie 
230 

Phe Ala Asp Leu 
245 

Asp Pro Ala Lys 



Glu Leu Ser Glu 

280 

Ala Glu Lys Leu 
295 

Gin Asp Gly Lys 
310 

Val Leu Met Leu 

325 

His Ala lie Glu 



Gly Met Thr Ser 



Asn Lys Thr His 

60 

Gly Phe lie Leu 

75 

Gly lie Val Val 
90 

Asp Leu Lys Gly 
105 

Leu Ala Lys Pro 



Lys Val Ala Asp 

140 

Val Ala Glu Asp 
155 

His Tyr lie Leu 
170 

Lys Gin Val Ala 
185 

Ala Thr Ala Asn 



Phe Gin Phe Asn 

220 

Leu Val Asp His 
235 

Arg Gin Gly Gly 
250 

Lys Ala Glu Lys 
265 

Arg Glu Lys Glu 



Gly lie Asp Pro 

300 

Leu Gly Leu Glu 
315 

Ser Asp lie Glu 
330 

His Ala Arg Glu 
345 



Asn Lys lie Lys 
45 

Lys Gly Val Ala 



Thr Lys Asp Ser 

80 

Asp His Asp Gly 

95 

Ser Pro Phe Glu 
110 

Ala Val Ala Gin 
125 

Pro His His His 



Ala Leu Gly Tyr 

160 

Lys Ser Ser Leu 
175 

Thr Arg Leu Pro 
190 

Gly lie Pro Gly 
205 

Gly Gin Gly lie 



Asp Gly His Leu 

240 

Trp Ala His Val 
255 

Pro Ala Glu Thr 
270 

Tyr Gin Glu Lys 
285 

Ser Thr lie Lys 



Tyr Pro His His 

320 

lie Gly Lys Asp 
335 

Leu Glu Lys His 
350 
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Lys Val Gly Met 
355 

He Leu Asp He 
370 

Asn Glu Lys Asp 
385 

Lys Leu Asp Leu 



Ser Leu Leu Pro 

420 

Lys Asp lie Ser 
435 

Met Thr Lys Thr 
450 

Gin Leu Glu Gly 
465 

Phe Leu Ser Lys 



Gly He Glu Asp 

500 

Leu Val Leu Ser 
515 

Leu His Gin Leu 
530 

Leu Ser Pro Val 
545 

Arg Asn Ala Asp 



Thr Leu Met Val 

580 

Asn Asn Pro Asn 
595 

Ser Leu Glu Gly 
610 

Glu Gly Asn Gin 
625 

Leu Thr Phe Leu 



Val Asn Asn Phe 



Asp Thr Leu Arg 

360 

Val Arg Thr His 
375 

Pro Asn Met Met 
390 

Gly Ser Arg Lys 
405 

Asn Leu Glu Thr 



Pro Val Leu Gin 

440 

Gly Val Thr Asp 
455 

He Asp He Ser 
470 

Tyr Lys Asn Leu 
485 

He Arg Pro Leu 



Asn Asn Lys He 

520 

Gin Glu Leu His 
535 

Ser His Lys Glu 
550 

Val Asp Leu Ala 
565 

Asn Asp Thr Lys 



Leu Ser Ser Leu 

600 

He Glu Ala Ser 
615 

lie Lys Ser Leu 
630 

Asp Val Thr Gly 
645 

Thr Ala Leu Asp 



Ala Leu Gly Phe 



Asp Ala Pro Thr 

380 

Lys Glu Trp Leu 
395 

Asp Pro Leu Gin 
410 

Leu Gly He Gly 
425 

Phe Lys Lys Leu 



Tyr Arg Phe Leu 

460 

Gin Asn Asn Leu 
475 

Thr Leu Val Ala 
490 

Gly Gin Leu Pro 
505 

Ser Asp Leu Ser 



He Asp Asn Asn 

540 

Ser Leu Thr Val 
555 

Thr Leu Gin Ala 
570 

Val Ser His Leu 
585 

Ser He Asn Arg 



Ser Val He Val 

620 

Val Leu Lys Asp 
635 

Asn Gin Leu Thr 
650 

He Leu Ser Val 



Asp Glu Glu Val 
365 

Pro Phe Pro Ser 



Ala Thr Val He 

400 

Arg Lys Gly Leu 
415 

Phe Thr Pro He 
430 

Lys Gin Leu Leu 
445 

Asp Asn Met Pro 



Lys Asp He Ser 

480 

Ala Ala Asp Asn 
495 

Asn Leu Lys Phe 
510 

Pro Leu Ala Ser 
525 

Gin He Thr Asp 



Val Asp Leu Ser 

560 

Pro Lys Leu Glu 
575 

Asp Phe Leu Lys 
590 

Ala Gin Leu Gin 
605 

Arg Val Glu Ala 



Lys Gin Gly Ser 

640 

Ser Leu Glu Gly 
655 

Ser Lys Asn Gin 
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660 

Leu Thr Asn Val 
675 

Asp lie Ser His 
690 

Gin His lie Pro 
705 

Gly Ser Met Val 



Thr Lys Ala Lys 

740 

Asn His Asn His 
755 

Arg Asp Lys Asp 
770 

Lys Asp Glu Gin 
785 



Asn Leu Ser Lys 

680 

Asn Asn He Ser 
695 

Glu Ala He Ala 
710 

Gly Asn Gly Thr 
725 

Glu Ser Ala Gin 



Thr Tyr Glu Asp 

760 

Asp His Asp His 
775 

Asn His Ala Asp 
790 



665 

Pro Asn Lys Thr 



Leu Ala Asp Leu 

700 

Lys Asn Phe Pro 
715 

Ala Glu Glu Lys 
730 

Glu Ala Ser Glu 
745 

Glu Glu Gly His 



Glu His Glu Asp 

780 



670 

Val Thr Asn lie 
685 

Lys Leu Asn Glu 



Ala Val Tyr Glu 

720 

Ala Ala Met Ala 
735 

Ser His Asp Tyr 
750 

Ala His Glu His 
765 

Glu Asn Glu Ala 



<210> 5 
<211> 2469 
<212> DNA 

<213> Streptococcus agalactiae 
<400> 5 

gtgaagaaaa catatggtta tatcggctca 
ggaagttacc agcttggtaa gcatcatatg 
tatattgatg atagcaaagg taaggtaaaa 
atcagtgctg aagaaggcat ctctgctgaa 
tatgttacct cacacggtga ccattatcat 
attattagtg aagagttgtt gatgacggat 
atcaatgaaa tcttagacgg ttacgttatt 
aagccaggta gtaagcgcaa aaacattcga 
aaaggaacta aagaagctaa agaaaaaggt 
gaagttgcgg cagtcaatga agcaaaaaga 
atttttagtc cgacagatat cattgatgat 
aatcactatc attatattcc taaaaaagat 
gcctactgga gtcaaaaaca aggtcgaggt 
gccccaggtc gtaggaaagc cccaattcct 
cagccagata acggtggtta tcatccagcg 
aaacaccaaa gagatgagtt taaaggaaaa 
cgtcttgatt tgaaataccg tcatgtggaa 
gtgatcaaat caaacgcttt tgggtatgtg 
ccaagaagtc agttatcacc acttgaaatg 
actgatgaca acgactcagg ttcagatcac 
acctttcttg gtcatcgcat caaagcttac 
acgagtgatg cttatgtttt tagtaaagaa 
acagctaaac acggagatca tttccactat 
ttggatgagg tcgctaactg ggtgaaagca 
ttggatcagg aacaaggcaa agaaaaacca 
gtaacaaaag atggtaaagt gggctatatt 



gttgctgcta ttttactagc tactcatatt 60 
ggtctagcaa caaaggacaa tcagattgcc 12 0 
gcccctaaaa caaacaaaac gatggatcaa 18 0 
cagatcgtag tcaaaattac tgaccaaggt 24 0 
ttttacaatg ggaaagttcc ttatgatgcg 3 00 
cctaattacc attttaaaca atcagacgtt 3 60 
aaagtcaatg gcaactatta tgtttacctc 420 
accaaacaac aaattgctga gcaagtagcc 4 80 
ttagctcaag tggcccatct cagtaaagaa 54 0 
caaggacgct atactacaga cgatggctat 600 
ttaggagatg cttatttagt acctcatggt 660 
ttgtctccaa gtgagctagc tgctgcacaa 720 
gctagaccgt ctgattaccg cccgacacca 78 0 
gatgtgacgc ctaaccctgg acaaggtcat 840 
cctcctaggc caaatgatgc gtcacaaaac 900 
acctttaagg aacttttaga tcaactacac 960 
gaagatgggt tgatttttga accgactcaa 1020 
gtgcctcatg gagatcatta tcatattatc 1080 
gaattagcag atcgatactt agccggccaa 114 0 
tcaaaaccat cagataaaga agtgacacat 12 0 0 
ggaaaaggct tagatggtaa accatatgat 126 0 
tccattcatt cagtggataa atcaggagtt 1320 
ataggatttg gagaacttga acaatatgag 13 8 0 
aaaggtcaag ctgatgagct tgttgctgct 144 0 
ctctttgaca ctaaaaaagt gagtcgcaaa 150 0 
atgccaaaag atggcaagga ctatttctat 1560 



8 



WO 01/14421 



PCT/US00/23417 



gctcgttatc aacttgattt gactcagatt gcctttgccg aacaagaact aatgcttaaa 1620 
gataagaagc attaccgtta tgacattgtt gatacaggca ttgagccacg acttgctgta 168 0 
gatgtgtcaa gtctgccgat gcatgctggt aatgctactt acgatactgg aagttcgttt 1740 
gttatcccac atattgatca tatccatgtc gttccgtatt catggttgac gcgcaatcag 1800 
attgcaacaa tcaagtatgt gatgcaacac cccgaagttc gtccggatgt atggtctaag 1860 
ccagggcatg aagagtcagg ttcggtcatt ccaaatgtta cgcctcttga taaacgtgct 192 0 
ggtatgccaa actggcaaat tatccattct gctgaagaag ttcaaaaagc cctagcagaa 198 0 
ggtcgttttg cagcaccaga cggctatatt ttcgatccac gagatgtttt ggcaaaagaa 204 0 
acttttgtat ggaaagatgg ctcctttagc atcccaagag cagatggcag ttcattgaga 210 0 
accattaata aatccgatct atcccaagct gagtggcaac aagctcaaga gttattggca 216 0 
aagaaaaatg ctggtgatgc tactgatacg gataaacctg aagaaaagca acaggcagat 222 0 
aagagcaatg aaaaccaaca gccaagtgaa gccagtaaag aagaaaaaga atcagatgac 228 0 
tttatagaca gtttaccaga ctatggtcta gatagagcaa ccctagaaga tcatatcaat 234 0 
caattagcac aaaaagctaa tatcgatcct aagtatctca ttttccaacc agaaggtgtc 24 0 0 
caattttata ataaaaatgg tgaattggta acttatgata tcaagacact tcaacaaata 24 6 0 
aacccttaa 2469 



<210> 6 
<211> 822 
<212> PRT 

<213> Streptococcus agalactiae 
<400> 6 

Val Lys Lys Thr Tyr Gly Tyr lie Gly Ser Val Ala Ala lie Leu Leu 
15 10 15 

Ala Thr His lie Gly Ser Tyr Gin Leu Gly Lys His His Met Gly Leu 

20 25 30 

Ala Thr Lys Asp Asn Gin lie Ala Tyr lie Asp Asp Ser Lys Gly Lys 

35 40 45 

Val Lys Ala Pro Lys Thr Asn Lys Thr Met Asp Gin lie Ser Ala Glu 
50 55 60 

Glu Gly He Ser Ala Glu Gin He Val Val Lys He Thr Asp Gin Gly 
65 70 75 80 

Tyr Val Thr Ser His Gly Asp His Tyr His Phe Tyr Asn Gly Lys Val 

85 90 95 

Pro Tyr Asp Ala lie He Ser Glu Glu Leu Leu Met Thr Asp Pro Asn 

100 105 110 

Tyr His Phe Lys Gin Ser Asp Val He Asn Glu He Leu Asp Gly Tyr 
115 120 125 

Val He Lys Val Asn Gly Asn Tyr Tyr Val Tyr Leu Lys Pro Gly Ser 
130 135 140 

Lys Arg Lys Asn He Arg Thr Lys Gin Gin He Ala Glu Gin Val Ala 
145 150 155 160 

Lys Gly Thr Lys Glu Ala Lys Glu Lys Gly Leu Ala Gin Val Ala His 

165 170 175 

Leu Ser Lys Glu Glu Val Ala Ala Val Asn Glu Ala Lys Arg Gin Gly 

180 185 190 
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Arg Tyr Thr Thr 
195 

Asp Asp Leu Gly 
210 

Tyr lie Pro Lys 
225 

Ala Tyr Trp Ser 



Arg Pro Thr Pro 

260 

Thr Pro Asn Pro 
275 

Pro Ala Pro Pro 
290 

Asp Glu Phe Lys 
305 

Arg Leu Asp Leu 



Glu Pro Thr Gin 

340 

His Gly Asp His 
355 

Glu Met Glu Leu 
370 

Asp Ser Gly Ser 
385 

Thr Phe Leu Gly 



Lys Pro Tyr Asp 

420 

His Ser Val Asp 
435 

His Tyr lie Gly 
450 

Ala Asn Trp Val 
465 

Leu Asp Gin Glu 



Val Ser Arg Lys 



Asp Asp Gly Tyr 

200 

Asp Ala Tyr Leu 
215 

Lys Asp Leu Ser 
230 

Gin Lys Gin Gly 
245 

Ala Pro Gly Arg 



Gly Gin Gly His 

280 

Arg Pro Asn Asp 
295 

Gly Lys Thr Phe 
310 

Lys Tyr Arg His 
325 

Val lie Lys Ser 



Tyr His lie lie 

360 

Ala Asp Arg Tyr 
375 

Asp His Ser Lys 
390 

His Arg lie Lys 
405 

Thr Ser Asp Ala 



Lys Ser Gly Val 

440 

Phe Gly Glu Leu 
455 

Lys Ala Lys Gly 
470 

Gin Gly Lys Glu 
485 

Val Thr Lys Asp 



lie Phe Ser Pro 



Val Pro His Gly 

220 

Pro Ser Glu Leu 
235 

Arg Gly Ala Arg 
250 

Arg Lys Ala Pro 
265 

Gin Pro Asp Asn 



Ala Ser Gin Asn 

300 

Lys Glu Leu Leu 
315 

Val Glu Glu Asp 
330 

Asn Ala Phe Gly 
345 

Pro Arg Ser Gin 



Leu Ala Gly Gin 

380 

Pro Ser Asp Lys 
395 

Ala Tyr Gly Lys 
410 

Tyr Val Phe Ser 
425 

Thr Ala Lys His 



Glu Gin Tyr Glu 

460 

Gin Ala Asp Glu 
475 

Lys Pro Leu Phe 
490 

Gly Lys Val Gly 



Thr Asp lie lie 
205 

Asn His Tyr His 



Ala Ala Ala Gin 

240 

Pro Ser Asp Tyr 
255 

lie Pro Asp Val 
270 

Gly Gly Tyr His 
285 

Lys His Gin Arg 



Asp Gin Leu His 

320 

Gly Leu lie Phe 
335 

Tyr Val Val Pro 
350 

Leu Ser Pro Leu 
365 

Thr Asp Asp Asn 



Glu Val Thr His 

400 

Gly Leu Asp Gly 
415 

Lys Glu Ser lie 
430 

Gly Asp His Phe 
445 

Leu Asp Glu Val 



Leu Val Ala Ala 

480 

Asp Thr Lys Lys 
495 

Tyr lie Met Pro 
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500 



505 



510 



Lys Asp Gly Lys Asp Tyr Phe Tyr Ala Arg Tyr Gin Leu Asp Leu Thr 

515 520 525 

Gin lie Ala Phe Ala Glu Gin Glu Leu Met Leu Lys Asp Lys Lys His 

530 535 540 

Tyr Arg Tyr Asp lie Val Asp Thr Gly lie Glu Pro Arg Leu Ala Val 

545 550 555 560 

Asp Val Ser Ser Leu Pro Met His Ala Gly Asn Ala Thr Tyr Asp Thr 



Gly Ser Ser Phe Val He Pro His He Asp His He His Val Val Pro 

580 585 590 

Tyr Ser Trp Leu Thr Arg Asn Gin He Ala Thr He Lys Tyr Val Met 
595 600 605 

Gin His Pro Glu Val Arg Pro Asp Val Trp Ser Lys Pro Gly His Glu 
610 615 620 

Glu Ser Gly Ser Val He Pro Asn Val Thr Pro Leu Asp Lys Arg Ala 
625 630 635 640 

Gly Met Pro Asn Trp Gin He He His Ser Ala Glu Glu Val Gin Lys 

645 650 655 

Ala Leu Ala Glu Gly Arg Phe Ala Ala Pro Asp Gly Tyr He Phe Asp 

660 665 670 

Pro Arg Asp Val Leu Ala Lys Glu Thr Phe Val Trp Lys Asp Gly Ser 
675 680 685 

Phe Ser He Pro Arg Ala Asp Gly Ser Ser Leu Arg Thr He Asn Lys 
690 695 700 

Ser Asp Leu Ser Gin Ala Glu Trp Gin Gin Ala Gin Glu Leu Leu Ala 
705 710 715 720 

Lys Lys Asn Ala Gly Asp Ala Thr Asp Thr Asp Lys Pro Glu Glu Lys 

725 730 735 

Gin Gin Ala Asp Lys Ser Asn Glu Asn Gin Gin Pro Ser Glu Ala Ser 

740 745 750 

Lys Glu Glu Lys Glu Ser Asp Asp Phe He Asp Ser Leu Pro Asp Tyr 
755 760 765 

Gly Leu Asp Arg Ala Thr Leu Glu Asp His He Asn Gin Leu Ala Gin 
770 775 780 

Lys Ala Asn He Asp Pro Lys Tyr Leu He Phe Gin Pro Glu Gly Val 
785 790 795 800 

Gin Phe Tyr Asn Lys Asn Gly Glu Leu Val Thr Tyr Asp He Lys Thr 



565 



570 



575 



805 



810 



815 
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820 
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